Enhancing Accuracy for Protein Prediction Secondary Structure by a New Hybrid Method
نویسندگان
چکیده
Prediction of protein secondary structure is an important step on the way to spell out its three dimensional structure and its function. This paper describes a new technique for prediction of secondary structure of protein based on contemporary machine learning methodology and data mining approach. More than one method has been developed to predict the protein secondary structure from the amino acids sequence; these methods show that we can achieve accuracy up to 80%. The work in this research is consists of three parts. In the first part, the secondary structure of each amino acid is predict alone with naive bays classifier, this method is based on amino acid preferences for different secondary structure. In the second part, an evolutionary algorithm to ameliorate this prediction is used; this method is based on physicochemical properties of protein regions. In the last part, a fragments bank which contains the protein fragments frequently detected in the Protein Data Bank (PDB) was developed; this method is based on the sequence alignment of protein but with a reduced database. The results of this research shows that the proposed method is improved the best know predictive accuracy by 4.5%, and attaint 85% accuracy with different datasets.
منابع مشابه
Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملImproved Protein Secondary Structure Prediction using a Intelligent HSVM Method with a New Encoding Scheme
Prediction of protein secondary structures is an important problem in bioinformatics and has many applications. Successful secondary structure predictions provide a starting point for direct tertiary structure modelling, and also can significantly improve sequence analysis and sequence-structure threading for aiding in structure and function determination. Now many secondary structure predictio...
متن کاملA knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence
Motivation: In our previous approach, we proposed a hybrid method for protein secondary structure prediction, called HYPROSP, which combined our proposed knowledge-based prediction algorithm PROSP and PSIPRED. The knowledge base constructed for PROSP contains small peptides together with their secondary structural information. The hybrid strategy of HYPROSP uses a global quantitative measure, m...
متن کاملA Hybrid Method for Protein Secondary Structure Prediction
Protein secondary structure can be used to help determine the tertiary structure via the fold recognition. Predicting the secondary structure from the protein sequence has attracted the attention of many researchers. Support Vector Machine (SVM) is a new learning algorithm based on statistical learning theory that has been successfully applied to the protein secondary structure prediction probl...
متن کاملA New Hybrid Framework for Filter based Feature Selection using Information Gain and Symmetric Uncertainty (TECHNICAL NOTE)
Feature selection is a pre-processing technique used for eliminating the irrelevant and redundant features which results in enhancing the performance of the classifiers. When a dataset contains more irrelevant and redundant features, it fails to increase the accuracy and also reduces the performance of the classifiers. To avoid them, this paper presents a new hybrid feature selection method usi...
متن کامل